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Abstract — Fountain codes are rateless erasure-correcting codes, 
i.e., an essentially infinite stream of encoded packets can be 
generated from a finite set of data packets. Several fountain 
codes fiave been proposed recently to minimize overliead, many 
of which involve modifications of the Luby transform (LT) 
code. These fountain codes, like the LT code, have the implicit 
assumption that the probability distribution is fixed throughout 
the encoding process. In this paper, we will use the theory 
of posets to show that this assumption is unnecessary, and by 
dropping it, we can achieve overhead reduction by as much as 
64% lower than LT codes. We also present the fundamental 
theory of probability distribution designs for fountain codes with 
non-constant probability distributions that minimize overhead. 

I. Introduction 

Since the introduction of the Luby transform (LT) code 
the first practical realization of a fountain code (|2|, other 
fountain code designs have surfaced. Examples include the 
raptor code |3|, which involves precoding an LT code with an 
outer erasure-correcting code, the real-time oblivious code 14J, 
which has a low memory requirement but which extensively 
uses a feedback channel and has a larger overhead than the LT 
code JS] Fig. 3], the systematic LT code [61, which yields low 
overhead using soft decoding, and the reconfigurable rateless 
code Q, which varies block length and encoding strategy, but 
which relies on frequent feedback so as to achieve overhead 
reduction. These new designs employ modifications of LT 
codes or rely on a feedback channel to minimize overhead, so 
in the absence of a feedback channel, existing fountain codes 
have the implicit assumption that the probability distribution 
is fixed throughout the generation of the output symbols. 

In this paper, we will show that by dropping this assumption, 
we can achieve significant overhead reduction while maintain- 
ing the same encoding and decoding complexities. In contrast 
to current fountain codes having fixed probability distributions, 
we consider fountain codes with non-constant probability 
distributions, without the assumption of any feedback channel, 
although feedback for acknowledgement is still required. 

Arguments in this paper assume a familiarity with both 
probability theory (see, e.g., |8|) and the theory of posets (see, 
e.g., 19] Ch. 3]). For any poset P and any a,b eP, denote a<b 
to mean b covers a, and denote a ^ 5 to mean a <b or a — b. 
For any n E N, denote the poset [n] as the set {1, . . . , n} with 
the usual order. For any matrix M, denote M{i,j) and 
to be the (z, j)-th entry and transpose of M respectively. If M 



has real entries, then ||M|jj^ denotes the sum of the absolute 
values of all entries in M. 

Sections HI] and |lll| provide a rigorous mathematical frame- 
work and the necessary tools, so that in Section IIVI we can 
prove that optimal codes have varying probability distributions 
and give an explicit criterion for optimal code designs. 

II. Fountain Code as a Stochastic Process 

Let C be a fountain code with k input symbols wi, . . . 
in ¥2, and denote W as the £xk matrix [wi, . . . , Wk]- For any 
probability distribution V on F|, denote as the proba- 

bility that vector v is chosen. Output symbols are generated 
by the map v t-^ Wv, and we assume the information of v is 
also transmitted, which in practice can be done using a header 
packet or via some time-synchronization between the source 
and destination [3|, i.e. we treat the transmission of output 
symbols as the transmission of column vectors in F2. 

For each n S N, denote A^„ as the set of all kxn matrices 
over F2. Denote = UneN-'^"' denote U as the collection 
of all subspaces of Fj, and denote the poset as the set U 
ordered by set inclusion. For each re {0,1,..., fc}, denote Ur 
as the collection of all r-dimensional subspaces of F|, denote 
Jr = \Ur\, and denote Kr = X^iLr ^l^o, for any matrix 
M E M., denote u{M) as the column space of M. 

Definition. A probability distribution sequence (abbreviated: 
p.d.s.) is a sequence {I?t}tgN such that Vt is a probability 
distribution on F2 for each t g N. If = 2? for every < e N, 
then we say this p.d.s. is constant, and by abuse of notation, 
we breviate this constant sequence simply as V. ■ 

Definition. A stream of output symbols with associated p.d.s. 
{Vf^ten is a sequence of independent discrete random vari- 
ables {XfjfgN such that for each t G N, we have 

Xt:(F^,2^^A)^(F^,2^=') (1) 

given by the mapping v ^ v. A {k, {Vt} tenj-fountain code is 
a set of k input symbols {wi, . . . , wj.} together with a stream 
of output symbols with associated p.d.s. {2?t}tgN- The special 
case of a fountain code with constant p.d.s. © is a (fc, I?)- 
fountain code, which coincides with the notation in Q. ■ 
For a (fc, {2?t}tgN)-fountain code over an erasure channel 
with erasure probability e, we can treat erasures as zero vectors 
transmitted, hence the p.d.s. at the destination is effectively 
{D*]tm, where each 2?* is given by 2?* (w) = (l-e)2?t(t/) for 



any non-zero v G Fj, and (0) = £+(1— e)2?(0). This means 
output symbols generated according to {Vt^tefu that may 
be erased with erasure probabilty e, are equivalent to output 
symbols generated according to {Vl^tefi that experience no 
erasures. Essentially, we have transformed the problem of a 
channel with erasures to the problem of a channel with no 
erasures, but with the restriction Vl (0) > e for every i e N. 

Definition. Let {XtjtgN be a stream of output symbols with 
associated p.d.s. {Vt^ten- For each t G N, denote at as the 
probability distribution on M.t given by 



(2) 



which equals Pr(Xi ■ ■ ■ Xt = [vi, . . . ,vt])- Define the discrete 
random variable Gt = Xi ■ ■ ■ Xt as the concatenation of the 
first t random variables in the sequence {Xt}tefi^ so that 

Gt:iMt,2-^%at)^iMt,2-^^) (3) 
given by the map [vi, . . . ,vt] ^ [vi, . . . , vt]- We say {GfjteN 
is a generator matrix sequence with associated p.d.s. {'Dt}teN- 
Each random variable Gt is called a generator matrix. 

Also, let f : M ^ U he the map M i-> u{M). For each 
< G N, define the discrete random variable 

Ft : (A^t,2^',at) ^ (^,2") (4) 
by the relation Yt — f{Gt)- We call {YfltgN ^ generator 
subspace sequence with associated p.d.s. {"Dt^tefi- * 

It is well-known in combinatorics that U- is a (finite) 
complete modular lattice and hence a graded poset, with the 
dimension of the subspace as the rank function of this poset. 
In particular, the modularity of U- just means 

dim(L/i) + dim(C/2) = dim([/i A U2) + dim(C/i V U2) (5) 

for all Ui,U2 G W^. Since K„ = w(G„) = u{Xi---Xn) 
is by definition a random variable representing the span of 
the columns i.e. the smallest subspace in U 

containing subspaces u{Xi), . . . , u{Xn), we have the identity 



r„ = V uix,) 



(6) 



for all n G N. The next result then easily follows. 



Proposition 1. The generator subspace sequence forms a 
Markov chain. 

Proof: Choose any n G N, and let J/i , . . . , C/„ be arbitrary 
instances of the random variables Yi, . . . ,Yn respectively, such 
that Pr(r„ = C/„,r„-i -[/„-!,..., Yi - C/i) > 0. Next, 
choose an arbitrary U' G U. From (|6]l, we have 

Yn+i = (\J^ uiX,)^ V u(X„+i) = K„ V u(X„+i), (7) 

so since Yi, . . . , y„ depend on Xi, . . . , X^, and since X„+i 
is by definition independent of Xi, . . . , Xn, we get 

Pr(y„+i - [/'|y„ = [/„,..., Yi = c/i) 

= Pr(y„ V w(X„+i) - U'\Yn -{/„,..., Yi = C/i) 
= Pr(Y„ V u{Xn+i) = C/'|Y„ = [/„) 

= Pr(Y„+i = [/'|Y„ = [/„). (8) 
Consequently, {YfjfgN forms a Markov chain. □ 



Definition. For any U,U' and any probability distribution 
V on Fj, we define 

'J2v{v), if U<U'; 

veu'\u 

; V{v), if [/' = U; 



l3v{U,U') 



veu 



0, 



otherwise. 



Lemma 2. Let {YtjtgN be a generator subspace sequence 
with associated p.d.s. {Vtjtefi- If U and n G N such that 
Pr(Y„ = [/) > 0, then for any U' G U, we have 

Pr(Y„+i = U'\Yn - [/) = /3p(C/, {/')■ (9) 

Proof: Since Pr(Y„ = L/) > 0, there exists some M G 
A1„, with CT„(A/) > 0, such that u{M) = U. For any arbitrary 

V e¥^, it follows from © that 

u([M,t;l) = u{M) V u(i7) = V w(i7). (10) 

Since dim{u{v)) < 1, we have (|5]l implies 

dim(M([A-/,iS])) = dim(L/ Vu(i/)) < dim(C/) + 1, (11) 

hence [/ sS u{[M,v\), and Pr(Y„+i = ;7'|Y„ = [/) ^ 
implies J7 ^ [/'. In particular, U < u{[M,v\) if and only if 

V G u{[M,v])\U, and u([M,i;]) = [/ if and only if v e U. 
Thus, by considering: I) U < U'; and 2) U' = J7, we sum up 
the probabilities of the possible vectors and get (|9]l. □ 

III. Fountain Matrix 

Combining Proposition [T| and Lemma|2] we know {YfjtgN 
is a stationary Markov chain if the corresponding p.d.s. 
{2?t}tgN is constant, which then allows us to use its transition 
matrix to compute transitional probabilities. In this section, 
our main goal is to construct the fountain matrix, which is an 
analogue of the transition matrix in the general non-constant 
p.d.s. case and an important tool for p.d.s. design analysis. 

Definition. Let P be an arbitrary finite poset with n elements. 
Recall that a refinement of P is another poset F' such that 
P = P' as sets, and such that for any y G P, we have x < y 
in P implies x < y in P'. Let P* be any refinement of P 
such that P* is totally orderecH and let (/) : P* — ^ [n] be the 
(unique) poset isomorphism corresponding to this refinement. 
We say (p is a total- refinement map o/P, and we say P* is the 
refinement corresponding to (p. ■ 

Definition. Let be a total-refinement map ofW^.Letn G N, 
and let 2?, Pi, ... , Vn be probability distributions on F2'. We 
define the {V, (f))-fountain matrix as the matrix T given by 

T{i,j)^Pv{<ly-'{i). <{>-'{]))■ (12) 
Let Ti be the (2?^, 0)-fountain matrix for each i G [n], 
and denote F„ as the matrix product Fn = Ti • ■ • r„. We 
call Fn the (p-fountain product corresponding to the n-tuple 
(2?i, . . . , 2?„). When n, </> and the corresponding probability 
distributions are not important, we simply say T is a fountain 
matrix and Fn is a fountain product. ■ 

'Such a refinement is possible, for example by considering the Hasse 
diagram of P and refining P by imposing additional cover relations from 
left to right. See |9 Ch. 3] for more details. 



Definition. Let {I?t}tgN be a p.d.s., and let cf) be any total- 
refinement map of For each n £ N, let Tn be the 
(2?„, 0)-fountain matrix, and let F„ be the ^-fountain product 
corresponding to . . . , 2?„). We then refer to {TfjfgN and 
{^FijtgN as the fountain matrix sequence and fountain product 
sequence respectively, each corresponding to {{'Dt]tefi:4>)- ■ 

Definition. Let n e N, let F be a (/)-fountain product 
corresponding to (Pi, . . . , 2?„), and let r, r' G {0, 1, ... , k). 
Let U* be the refinement corresponding to 0, let U* be the 
induced subposet of U* corresponding to set Ur, and let 
(pr : U* ^ [Jr] be the natural (unique) poset isomorphism. 
Define U*, and 1/)^' analogously. Define the {r,r')-fountain 
block of F as the submatrix of F corresponding to the Jr rows 
in {(f){U) : U e Ur} and the Jr' columns in : U E Ur'}. 

We say (pr is the r-th sub-refinement map of F. ■ 
Denoting the (r, r')-fountain block as Ar.r', we note that 

Aryiij) = (13) 

In particular, if F is the (2?, 0)-fountain matrix T, then 

Ar,r'{i,j) = r(0(0-n*)),0(<^;/(j))) = M^rH^),<P;'ij))- 

(14) 

Also, since dim is the rank function of the graded poset 
the set {(j>{U) : U £ Ur} C [K] is invariant over all total- 
refinement maps (j) of U-, thus we have: 

||Aj.r'||i is independent of the choice of (j). (15) 

Theorem 3. Let T be a (2?, -fountain matrix. Let r,r' G 
{0,1,..., fc} be arbitrarily chosen, and denote Ar.r' as the 
(r, r')-fountain block of T. We have the following properties: 

(i) Ar^r' is a zero matrix if r' < r or r' > r + 1. 

(ii) Ar^r is a diagonal matrix, with 

Ar,r{m,m) = V{v) (16) 

for each m G [Jr]- 

(iii) The row sum of each row in T is 1. 

Proof By definition, ([/,[/') ^ impHes U s5 f/'. 
Since U < U' implies rank(f/') = rank(C/) + 1, we get 
I3-d{U, U') ^ impHes U' = U ov rank([/') = rank(t/) + 1, 
thus proving property [(1)] as well as the diagonality of Ar^r 
for every r G {0, 1, ... , k}. Using (fT4l l. we can compute the 
diagonal entries explicitly, and property |(ii)| easily follows. 
Choose any U G Z//. By definition, row 4>{U) has row sum 

Mu, u') = Mu, u)+Y Mu, u') 

U'eu u'eu 

u<u' 

veu u'GUveu'\u 
u<u' 

We obviously have J2v&'' ■^(^) = 1' hence to prove property 
|(iii)[ it suffices to show that all the summands in the sum in 
([TtT i are distinct, and that every vector v G F| corresponds to 
exactly one such summand 'D{v) in ( fTTj l. For brevity, denote 
S as the partial sum X^uet/ ■^(^)' each U' £ U 

covering U, denote Sjj' as the partial sum J2veU'\u'^(^y 
Since U fl {U'\U) = 0, the summands in 5 are distinct from 
the summands in Sjj' for all U' covering U. 



Now, suppose U[,U2 £ U both cover U. If there is some 
u e ¥2X11 such that u e U{r\U2, then u impHes both 
U < UVu{u) < U{ and U < Uyu{u) < U2 in the poset 
which forces the equality U[ = U2 as U[,U2 both cover U. 
This means any pair of distinct partial sums in {Sjj' '■ U' G 
U, U <U'} must have distinct summands, thus every summand 
in ( [TtI i appears exactly once. Finally, for any ?; G F2, we have 
w G C/ V u{v). Since [/ ^ [/ V u{v), it follows that V{v) is a 
summand in ( fTTI l. therefore proving property |(iii)| □ 

Lemma 4. Let {YtltgN be a generator subspace sequence with 
associated p.d.s. {T>t}t£ti, let be a total-refinement map of 
U-, and let {i^fjtgN be the corresponding fountain product 
sequence. Then for any n G N and any U G U, we have 

Pr(r„ = C/) = F„(l,0(;7)). (18) 
Proof: This is analogous to the computation of marginal 
probabilities via the Chapman-Kolmogorov equations (see, 
e.g., im, ifTOl ). and the consideration of the product of transi- 
tion matrices of a non-homogenous Markov chain lITOl . □ 

Corollary 5. Let {GtjteN be a generator matrix sequence 
with associated p.d.s. {PtjtgN- Let cj) he any total-refinement 
map of U-, and let {Ftj^gN be the corresponding fountain 
product sequence. For any n G N and any arbitrary r G 
{0, 1, ... , fc}, denote Aq"^^ as the (0, r)-fountain block of F„. 
Then for any non-empty subset S of {0, 1, . . . , k}, we have 

Pr(rank(G„) eS) = Y, Hi' (19) 

res 

and this value is independent of the choice of 0. 

Proof: From (fTsl l, J2res ll^o"r Hi independent of the 
choice of (j>. The rest follows easily from Lemma |4] □ 
Using the same notations as above, we define 

ar,niT^i,...,V„)=J2\\A'o"j\\i (20) 

j=r 

for n G N, r G {0, 1, . . . , k}. If the context is clear (i.e. when 
{'Dt}t£fi and e are given), then we breviate ar.ni'D*, ■ ■ ■ , T^n) 
as ar.n- Corollary |5] then tells us Or^n is the probability of the 
generator matrix G„ having rank > r at the destination. 

IV. Choosing the Next Probability Distribution 

Let G N, and suppose we are given N probability 
distributions X'i,...,X'jv on Fj, which form the first N 
probability distributions of some p.d.s. design at the source. 
To minimize overhead, we then need to decide on the next 
distribution 2?Ar+i in the p.d.s. so that ar^N+i is maximized. 

Definition. Let n G N, and let 2?i , . . . , 2?„ be any n probabil- 
ity distributions on F§. Let (p be any total-refinement map of 
U-, and let F he the 0-fountain product corresponding to the 
n-tuple {Vi, . . . , Vn). For each t, i, j G {0, 1, . . . , k}, denote 
ipt as the i-th sub-refinement map of F, and denote Aij as 
the -fountain block of F. For any r G {0, 1, . . . , fc} and 
any non-zero iJ G F|, we define 

7,,„(z/|2?i,...,2?„) = ^ [Ao,.(l,0.([/))]. (21) 

u-.ueUr- 
veu 

By default, set the empty sum 7o,n(''7|^5i, ■ • ■ , 'Dn) to be 0. 



Define 7™(2?i, . • . , AO [abbreviated: 7^?^^" if context is 

clear] as the minimum value of 'jr.n{v\'Di, . . . , 2?„), where v 

varies over all non-zero vectors in Fj. Also, define 

rf^{Vi,...,V„) = argmin 7,,„(w|Pi,...,P„), (22) 
ffeF^\{o} g 

Theorem 6. Let n £ N and let Vi, . . . be probability 
distributions on Fj. Let (phe a total-refinement map of let 
F be the 0-fountain product corresponding to (2?i, . . . , Vn), 
and for each i,j € {0,1,..., fc}, denote A^.j as the 
fountain block of F. For any t e {0, 1, . . . , fc}, u e F|\{0}, 
we breviate 'yt,n{v\Vi, . . . .Vn) as 7t^„(t7). Then, for any r G 
[fc] and any probability distribution T) on F2, we have 

ar,„+i(I'i, . • . ,2?„,2?) = Q;r,„(I?i, . . . + C, (23) 
where C is given by 
C - (1 - P(0)) Po,r-i II 1 - ( ^ 7r-i,„(i7)2?(i?)) . (24) 

iTGF|\{0} 

Proof: See appendix. □ 

Corollary 7. Let n e N and let Pi , . . . , 2?„ be probability 
distributions on F^ with erasure probability e. Let </> be a 
total-refinement map of U-, and denote Aqj as the (0,j)- 
fountain block of the 0-fountain product corresponding to 
(2?i, . . . , D*). Then, for any r G [k] and any probability 
distribution V on Fj satisfying V* (0) ~ e' > e, we have 

a,,„+i(2?J,...,2?:,P*) <a,,„(2?J,...,P:) + C', (25) 
where C" is a non-negative value given by 

C' = {1- £')(ll^o,.-i|li - 7ri„(2?J , • • • , K))- (26) 
Breviating r^i\JVl,...,V*^) as L™"} „, equality holds in 
(l25T l if and only if V satisfies 

^P* («) = !-£'. (27) 

Proof: By Theorem |6] it suffices to prove that 

' {l-e')Y;^l^<J2hr-iAv)'D*iv)] (28) 
ffeF|\{o} 

Note that by the definition of 'yr-i,n{v), we have 



Avg. Overhead: 37.4564% Avg. Overhead: 30.4930% Avg. Overhead: 24.9533% 



0<7 



r — l,n 



< 



7r-l,n(w) < ||Ao,r-l||i 



(29) 



for all V e F§\{0}. Treating {V*{v) : v e ¥^,v ^ 0} as 2*^-1 
non-negative real-valued variables subject to the constraint 



V*{v) 



1 



(30) 



iTeFj\{0} 

we see that (|29] | and (l30l l imply (l28l l. The lower bound in 
is attained only if 2?*(w) ^ implies v e F™" „, hence 
yields (|27li. Finally, C" > follows trivially from (|29ll. □ 

Corollary 8. Let n e N, and let n probability distributions 
Vi, . . . , Vn on Fj corresponding to erasure probability e be 
given. Then for any r G [fc], and any probability distribution T) 
with erasure probability e, we have Q;r,n+i(2?i, • . • , Z^j^, 2?*) 
is maximized if and only if V satisfies the condition: 

V{v) ^O^ve T'^ll^iVl VI). (31) 

Proof: By Corollary |7l since Pi , . . . , Vn are given im- 
plies (Po,._i||i - 7ri,„(2?i*, • • • ,Z?;j) in M is fixed, the 
upper bound in ( l25b is independent of the choice of P* {v) for 
non-zero t7. So to maximize C in ( |26] |, we need to minimize 



LT Code 



T . — ^l^liiiliiiiw 



300 



300 





z 0' — ' 0^ 

250 350 450 550 250 350 450 550 250 350 450 550 
Number of Output Symbols Required for Complete BP Decoding 

Avg. Overhead: 33.8239% Avg. Overhead: 23.1592% Avg. Overhead: 9.0791% 
£200 , 300 1000 



Designed B 
Code ■slOO 



250 350 450 550 250 350 450 550 250 350 450 550 
Number of Output Symbols Required for Complete BP Decoding 

Fig. 1 . Compaiison between LT code and designed code, each with A; = 250 
input symbols. Erasure probabilities from left to right: 10%, 5%, 1%. 



!?*(()), i.e. set 2?* ((J) = e, and the assertion easily follows. □ 
Corollary [8] serves as a criterion for choosing the next prob- 
ability distribution 2? in a p.d.s. with some initial probability 
distributions already determined. In particular, the design of 
the next distribution V is independent of e. Starting with any 
2?i satisfying X'i(O) = 0, the recursive use of Corollary [8] then 
serves as a criterion for optimal code designs. 

Note also that for p.d.s. {Vf^ten, if Z?„+i is fixed, then 
by Corollary |7] a^.n+i is maximized if 7™" „ is minimized. 
So for {PflfeN to be optimal, we must have 7""" 1 = 0, 
forcing 7""" 2=0, hence we get Vi ^ T>2- Consequently, 
optimal codes necessarily have non-constant p.d.s. designs. In 
particular, such designs are 'universal', i.e. they are optimal 
for erasure channels with arbitrary erasure probabilities, so a 
constant p.d.s. is not required for the 'universality' property. 

Following notations in yj, the robust soliton distribution 
depends on parameters fc, c, 8, which we denote as [Lk.c,&- Now, 
consider the (fc, {2?t}fgN)-fountain code C such that 

1 



Ul 



U2 



Us 



1 



(32) 



where for each t £ [fc — 1] and some initial dt G [/c — t] 
chosen according to degree distribution ^J,k-t,c,s, we have 
ut G Fj"* is a vector of Hamming weight dt — 1 that is chosen 
uniformly at random. This description uniquely determines 
T>i, . . . jVk, which are clearly distinct. Furthermore, we fix 
Vk+i, ■ ■ ■ , Pfc+^ij to be probability distributions induced by 
the degree distribution Hk,c,5^ and we define subsequent prob- 
ability distributions recursively: X'„_|_|^3fcj = for all n G N. 
Using Corollary |8] we can check that ar.n(Pi, • ■ • , P,*) is 
maximized for all r,n E [k] and any erasure probability e, 
hence the first k probability distributions of C are optimal. 

Using belief propagation (BP) decoding and parameters k = 
250, c — 0.03, 6 — 0.5, we run simulations for the LT code 
and C. Each histogram in Fig. [T] represents 10000 iterations, 
and we see that C has a lower overhead than the LT code. 



e.g. 9.70%, 24.05%, 63.62% lower for e = 10%, 5%, 1% re- 
spectively. Note that other code designs with further overhead 
reduction are possible and will be elaborated in future work. 

V. Conclusion 

In this paper, we applied probability theory and the theory of 
posets, and showed that optimal fountain codes must have non- 
constant p.d.s. designs. A criterion for optimal code designs 
has also been derived. Simulations show an overhead reduction 
when the probability distributions are varied, hence our theory 
of optimal p.d.s. designs has immense significance for fountain 
codes, promising codes with low overhead. 

Appendix 
Proof of Theorem[6] 
Denote e — X'(O). For each i,j G {0, 1, ... , k}, denote 
(f)t as the t-th sub-refinement map of F, and denote A^^,. and 
A^r,j as the concatenations of matrices: 

^i,^r = [A-i^r, • ■ • , (33) 

-^^r,j = [AJj^ ^r+lj"i ■ • ■ 1 ■^Jj]'^ ■ (34) 

Also, we denote A^r.^r by the two equivalent expressions 

] (35) 

= [^r^^ri ^r+l,^ri • • • I ^fc.^r]'^- (36) 

We can then partition F into {r + 1)^ blocks so that 

^0,0 ^0,1 • ■ • ^0,r-l Ag^^r 

Alfl Ai^i ■ ■ ■ Ai^r-1 Ai^^r 

F^ : : : : . (37) 

• • • Ar-l^r-1 ^r-l,^r 
A^r,Q A^r.l ' ' ' A'^r.r—l A^r,^r 

Next, denote T as the (2?, 0)-fountain matrix, and denote 
A'^ J as the (i, j)-fountain block of T. Defining A' A'^^ , 
and A'-^^ >^ analogously, we can partition T into (r + 1) 
blocks analogous to dSTl l. The product FT is the (/)-fountain 
product corresponding to (2?i, . , V), thus we get 

(38)' 

Using Theorem |3Ii)| (l38l l reduces to 

Olr, n+l{T^lT ■■ ^T^mT^) = ll^O.r-l^r-l II i + ll^O.^Sr^'^r II 

(39) 

Denote B,. and B'^ for Ao,r-i^'r_i.^j. and ^o^^r^'^^.^r 
respectively. Then for each j e [K^], we have 

Br{l,]) - ^[^o,r-l(l,»)][A;_i,^,(z,j)], (40) 
i=l 

hence taking the norm, we get 

Jr-l 

j=l j=l i=l 

(41) 



Theorem [ 3ti)| tells us that yl'->j, ^ is a zero matrix for 
each t e {0,...,?' — 1}, so Theorem | 3tiii)| implies 
Z]jlri[^'^r,^r(*'i)] — 1 fo'" ^^^^ * S i^r], thus (|42] | rcduces 

l|5;ill=E[^0^^-(l'*)] = ll^O.^r-lli =«.,„(Pl, . . . ,P,{)t3) 

Similarly, Theorem |3ti)| yields ^ is a zero matrix for 
each j G {0, . . . , r — 2} (if any). Since Theorem I 31ii)| tells us 
A[._-^ is a diagonal matrix, with 

K-i.rS.^) ^Y.T^iv) (44) 

for each i G [Jr-i], it then follows from Theorem I 31iii)| that 



^[A;_l^^,(^,J)] = l- f^P(z;)j (45) 
for each i G [Jr-i]- Also, for any t G {0, 1, ... , fc}, we have 



E 



iTeF*\{0} U:U£Ut 



:Ao^*(1,0,(C/))]^2?(«) 

veu 

[AoAhMu))]]m 



(46) 



- E 

iTeFj\{0} 

Since Wr-i = {(t)'f^^_^^{i) : i G [J^-i]}, we have (ITTT i yields 

lli^.lll = E [[^0,.-l(l, (l - E ^(«^)^ ^ 

- ^[(l-£)[Ao 



-E 



i(l,0,_i(C/))]] 

[[Ao,,-l (1,0.-1 ([/))] 

e)Po,r-i|li 



= 5] [Ao,.„i(l,*)] 5][A;_l^^,(^,J)] 
Similarly, we also get 

ii5;iii = E [A,MlMY.\A'^r,^,{^,^) 

i=i L S=i 



(42) 



veu 

{I - e)\\Ao,r-i\\, - (J2 -fr-iAv)V{v)y (47) 

t7GFj\{0} 

where the last equality follows from ( |46] |. Finally, combining 
(|39] l. (l43T l and (EJ, we get (|23]l as claimed. 
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